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Abstract 

fr^ I We study a one parameter family of random graph models that spans 

Q^ ■ a continuum between traditional random graphs of the Erdos-Renyi type, 

(-H I where there is no underlying structure, and percolation models, where the 

possible edges are dictated exactly by a geometry. We find that previously 
developed theories in the fields of random graphs and percolation have, start- 
ing from different directions, covered almost all the models described by our 
family. In particular, the existence or not of a phase transition where a giant 
cluster arises has been proved for all values of the parameter but one. We 
^ I prove that the single remaining case behaves like a random graph and has a 

single linearly sized cluster when the expected vertex degree is greater than 
one. 



^ ! 1 Random Graphs and Finite Percolation 

00 . ^ ^ 

f^ , The Q{n,p) family of random graphs, originally due to Erdos and Renyi, is con- 

structed by letting the vertex set y = [n] = {0, 1, . . . ,n — 1}, and letting every 
possible edge {n, v} belong to the edge set E independently with probability p. 
Bernoulli bond percolation models, on the other hand, are typically constructed 

5^ \ by starting with a finite degree lattice and retaining only the edges therein in the 

same manner, and not any others (retained edges are called open). 

The study of random graphs is thus the study of models without an under- 
lying geometry, whereas percolation models depend heavily on the geometry and 
structure of the lattice on which they are defined. A reasonable question is to ask 
what happens if one relaxes the infiuence of the structure in percolation models - 
for example by allowing open edges to form between vertices more then one step 
from each other in the lattice. Such models, known as long-range percolation, 
have been studied previously (see |10j pj and below for more references). 
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Likewise, starting from the other direction, one may ask what happens to 
Q{n,p) Hke graphs when structure is introduced - making some possible edges 
more hkely to appear than others. Results about this can be gleaned from re- 
cent generalized random graph models [1], and show how much structure can be 
introduced while keeping the behavior of the model more or less intact. 

In fact, in terms of the distance dependence of the edges, known models for 
long-range percolation and generalized random graphs come very close to covering 
the whole spectrum. We will discuss a family of random graph models with a single 
real parameter a that regulates the influence of an underlying structure. We will 
see that the cases when a < 1 fall in the category of previously analyzed random 
graphs, while the cases where a > 1 fall in the category of long-range percolation 
models. We present some connectivity results for the final, critical, case where 
Q = 1. 

1.1 Notation 

As is common, we will use G to denote both specific graph realizations and the 
random graphs, though we strive to make the difference clear by context. Where 
Q is a random graph family, G ^ Q means that G is distributed according to this 
family. Ci = Ci{G) will denote the biggest connected component of G. 

As usual, a series of events Ai,A2,--- occurring asymptotically almost surely 
(a.a.s.) means that lim„_>oo P(^n) = 1- 

1.2 Organization 

The paper is organized as follows. In Section [2] we introduce our "a-model" of 
random graphs, and in Section [3] we go through how the behavior of the model 
varies, discussing the known results for most values of a. Finally Section[4]contains 
our analytic contribution. 

2 The a-model 

A major difference between the analysis of random graphs and percolation models 
is whether it is done in a finite or infinite setting. Questions about percolation are 
typically asked about the behavior of clusters on a infinite grid - the most basic 
question being whether an infinite cluster remains open. Q{n,p) is, on the other 
hand, almost always studied for finite values of n - this for the simple reason that 
if n = oo and p > the graph is a.s. not locally finite. The typical approach is 
instead to scale the value of p with n - in particular, sparse random graphs are 
ones where p = c/n for some fixed c (meaning that expected degree is essentially 
constant for all n). The question is then, rather than asking whether an infinite 



cluster exists, to look at the relative size of the largest cluster (compared to n) as 
a function of c. 

We take the latter approach here, using a degree normalizer and studying 
finite graphs, but note that it largely intersects with normalizer free models in 
cases where the degree is already limited by the structure. We will also restrict 
ourselves to a one dimensional geometry. One dimension is not an interesting 
environment for standard Bernoulli percolation, but long-range percolation can 
be fruitful here. Our geometry is based around the following metric: 

d{u,v) = min(|n — v\,n — \u — v\). 

This is equivalent to placing the vertices in a ring and using the geodesic distance 
(see Figure d]) . 

Definition 2.1. (The a-model) For a G [0, oo] the family of random graphs 
Gain, c) are graphs G = (F, E), where V = [n] and for u,v ^V 

Pu,v=^{{u,v} GE) 



ha,nd{u,vY 

where ha,n = Y2u£V-u^o ^/d{u,0)°^ , independently for all disjoint {u,v}. 
For Q = this equivalent to G{n,p) with p = c/{n — 1). When a = oo 

_ fc/2 ifd{u,v) = l 
1 otherwise. 

which is standard percolation, for which an infinite cluster cannot exist if c < 2. 
Random graphs with edge probabilities given by a power-law of the distance 
are not new, and have appeared in more or less exactly this form elsewhere. See 
the pioneering work of Aizenman, Newman, and Schulman [TO] [1], the ideas of 
Kleinberg [8], as well as later work by other authors [2], [S]. 

3 Regimes of the a-model : The Emergence of Struc- 
ture 

3.1 a = : Q{n,p) Random Graph 

When a = 0, the distance between the points does not affect connectivity, and 
the a-model is exactly the same as G{n,p) with p = c/{n — 1). This case has, of 
course, been extensively studied, see j6j as well as the book length discussions in 
[7] [3]. With regard to connectivity, it is known to undergo a phase transition at 
c = 1: the largest connected cluster is of size 9{logn) in the subcritical phase, 
6{n?'^) in the critical phase, and 9{n) in the supercritical phase. 




Figure 1: A realization of the a-inodel Gain, c) with a = 1, n = 100, and c = 2. 

Theorem 3.1. (Erdos, Renyi) Let G ~ Q{n,p) for p = c/{n — 1), and p he 
the survival probability of a Galton-Watson branching process with Poisson{c) 
offspring distribution. Then 

\Ci\/n^p 



as n 



oo. 



Much more is known regarding the distribution sequence of component sizes, 
see the above books for details. 



3.2 < a < 1 : Essentially a Random Graph 

When a > the geographic structure of the model starts affecting the edges. 
However, as long as a < 1 much of the general behavior is retained. While the 
random graph results above cannot be directly applied, this regime falls within a 
more recent general random graph model of Bollobas, Janson, and Riordan [3j . 

In the BJR model Q{n, n), one is given a "ground space" S which we take to 
be [0, 1], and a Borel measurable kernel k : S x S t—i- M^. For a sequence of points 
(j;i, j;2> • • • ) Xn) of S corresponding the n vertices, edges are added independently 
between each pair with probability: 



Pt, 



min(K(x„,x„)/n, 1). 



The model contains a lot of freedoms which we will not require. For instance 
the sequence of points may be random, in which case the limiting distribution 
obviously matters greatly. We will not need this, and indeed may set Xu = u/n. 
We then let: 

c/\x — y|" for X 7^ y 

otherwise 



K{x,y) 



(1) 



where \x — y\ is again interpreted as circular distance, this time on [0, 1]. 

In order for the results of BJR to hold, k must adhere to certain conditions 
which the authors call being "graphical" (Definition 2.7 in [3]). The troublesome 
condition for our k is that it must belong to L^{S x S, fix jj,) (where fj, in our case 
is the Lebesque measure on [0, 1]). Clearly this is true only for a < 1. 

For this case however, the BJR model with the above kernel gives pij = 
c/{n^~°'d{i,j)°') which is asymptotically equivalent to the a model but for a 
slightly different value of c. BJR prove that G{n,K) behaves more or less like 
the classical Q(n,p) ~ in particular, most of the proofs are very similar barring 
the technical difficulties incurred by the greater generality. With regard to con- 
nectivity, the same result as above holds, that the phase transition at which a 
0(n) cluster emerges is at c = 1. 

Theorem 3.2. (Bollobas, Janson, Riordan) If G is a random graph of size n 
from the BJR model with kernel k, then 

\Ci\/n ^ p^ 

where p^ is the survival probability of a multitype Galton- Watson process with the 
offspring distribution of x given by a Poisson point process on S with intensity 
k{x,-). 

Because our geometries are transitive, this reduces to a single-type Galton- 
Watson process with Poisson offspring. It is thus well known that p > exactly 
when the expected number of offspring is greater than 1. 

3.3 a = l : The Small World 

When a = 1, the resulting model no longer falls within the that of BJR since the 
K implied by ([T]) is no longer integrable at 0. On the other hand, it does not fall 
within the long-range percolation models discussed below, this time since 1/x is 
not integrable at infinity. The connectivity of the resulting "in-between" model 
has to our knowledge not been studied elsewhere. 

This model is of added interest due to the results of Kleinberg regarding small- 
world models and further work that has followed, see [8] [9]. These show that the 
value of a is intimately related with possibility of decentralized routing (path- 
finding) in graphs, and that it is exactly when the relation between distance and 
edge prevalence is as at a = 1 that such routing will find short paths. 

In Section H] below, we prove that this case also undergoes at phase transition 
equivalent to the previous - when the expected degree of each vertex is greater 
than one, Ci has order n, otherwise it is sublinear. 



3.4 1 < a < 2: Long- Range Percolation 

When we move to a > 1 the model undergoes another radical change. In this 
case the normalizing constant ha,n -^ ha < oo as n ^ cxo. The model is thus 
very similar to the long-range percolation models studied in the 1980s within 
mathematical physics (see |10) and [1] for rigorous results). It was found that 
these models do undergo a phase transition similar to the regimes above: for c 
less than a certain value there is no percolation, while for large c it can be shown 
to occur. 

While the theorem is stated for percolation on Z in [TO], the arguments in the 
proofs use only finite subsets, and may be stated as 

Theorem 3.3. (Newman, Schulman) For 1 < a < 2 there exists c < oo and 
(j) > such that for Gain, c) 

\Ci\ >(j)n a.a.s. 

In all the regimes of the a-model discussed in Sections 13.11 - 13. 3^ the critical 
value of c was found to be one. This is not possible when q > 1. 

Proposition 3.4. For any a > 1, there exists an e > such that for c = 1 + e in 
Qain^c) |Ci|/n^O a.a.s. 

Proof. Let e = l/{&h^). For a given graph, let K{v) be the clustering number of 
a vertex v. That is 

K{v) = number of 3-cycles containing v 

For a given vertex v, let Y be the number of second order neighbors. 

y< Y. {\N{v)\-i)-K{v) 

which implies 

B[Y] <B[\N{v)m\N{u)\] -B[K{v)]. 

Since E[K{v)] > P{v ^ v + l,v ^ v + 2,v + I ^ v) = l/{2hak^), it holds 
that E[y] < 1. It follows that any two-step exploration process on the graph is 
dominated by a subcriticial Galton- Watson process. D 

3.5 a = 2 : The Second Critical Value 

The situation when a = 2 is also handled in [lOj and [Ij. Here it turns out that 
for the a-model, as we have defined it, there is no giant cluster unless c is large 
enough that Pu,u+i = 1 (in which case everything is of course trivially connected). 



However, the authors show that there are in fact distributions with this tail decay 
where you will get a giant cluster, however, it is necessary that: 

hm inf d°'Pu,u+d > 1 

d~*oo 

In our case, this implies that c = ha, which of course is the trivial case where 
Pi,i+i = 1- For more general long-range percolation formulations one can define 
non-trivial situations which do percolate with this tail. 

3.6 2 < a < oo : Essentially Percolation 

When a > 2, the model shows behavior similar to standard percolation. A way 
to see this is to consider a renormalization of the vertex space into large blocks. 
For c < /iq let i?i, -B2; • • • , -B„/m be contiguous blocks of vertices, each of size m. 
Let Xi^j be the number of edges between blocks i and j. 

oo ^ 

Since ^j,j+i is the sum of independent events each occurring with probability 
smaller than one, that it has bounded expectation means that 

P(Xi,,+i > 0) < p < 1 

where p is independent of n and m. On the other hand, if |i — j| > 1, then 

V{Xij > 0) < E[Xij] < h'/mf'-^ -^ 

as m ^ oo [h' is a constant). 

This means that if m is sufficiently large, and we view two blocks as connected 
if there is any edge between them, then the system of connected blocks will look 
more or less like standard Bernoulli percolation. 

Formally, let k = logn and m = (logn)^' ("^^^ Then the integral bounds give 
that for any vertex x 

P(a; is connected to at least mk vertices) < 

P(A; adjacent blocks are connected in either direction from x) 

+ P(one of those 2k blocks has a non-adjacent connection) < 1/logn. 

for n sufficiently large. It follows that E[^ vertices in clusters larger than mk] < 
n/logn whence P(|Ci|/n > p) ^ for any p > 0. 

3.7 a = oo : Percolation 

As noted above, at a = oo the a-model is exactly percolation with probability 
c/2 that each edge is open. 
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4 Analysis of case a = 1 

In this section, we prove a result similar to Theorems 13.11 and 13.21 for the "Small 
World" case where a = 1. 

To Qi{n, c) we associate a Galton- Watson branching process {•^"(c)}jgz+ where 
the child distribution is the same as the marginal distribution of each vertices de- 
gree in Gi{n, c). Let p„ = Pnic) be the survival probability of this process, which 
we know tends to if c < 1 and a positive value otherwise. 

By the "law of rare events", the distribution 5i converges to a Poisson(c) 
distribution. Before proceeding we prove that this implies that Pnic) —>■ p{c) as 
n -^ oo, where p{c) is the survival probability of GW process with Poisson(c) 
offspring. This continuity result isn't ne'wo and seems to be assumed by some 
works on random graphs, but since we need it explicitly at several points below, 
we include it here. 

Lemma 4.1. Let for each fc € N, {Z^^j^q be a Galton- Watson branching process 
with offspring given by a non- degenerate distribution with probability generating 
function fk and extinction probability q^ (so that q^ = f{qk))- 

If fk ^ f CLS n ^ oo pointwise, then qk — > q, the smallest value in [0, 1] for 
which f{q) = q- 

Proof. Recall that the probability generating functions involved are convex, and 
take the value 1 at 1. Choose a subsequence ki such that gfc. — > q. 

q^Qk, = fkMkd -^ Kq) 

all as n — > oo. It follows that ^ is a fixed point of /. If q = 1 then that is /'s only 
fixpoint in [0, 1], and it follows directly that q = q and the result is established. 
If g < 1, / now has two fixpoints in [0, 1] by convexity: 1 and q. We must rule 
out the case q = 1. 

Assume that qk^—>l. Let q < s < qk^, which implies that fkX^) > *■ Letting 
i ^ oo for all g < s < 1, f{s) > s. But then there cannot exists a fixpoint q < 1 
such that f{q) = q, which contradicts out assumption. D 

Theorem 4.2. //G ~ ^i(n,c) then 

\Ci{G)\ p 



n 



p as n ^ oo 



^Thanks to Peter Jagers for helping me with this proof. 



The proof largely follows the proof for Erdos-Renyi type graphs (see [7] , and 
also [4J. We follow the proof of Lemma 9.6 in the latter without significant 
deviation up to the proof of Claim 2 in the latter half). The difference is that, 
because of the clustering, the branching process coupling breaks down sooner. 
Therefore we need a different argument for why all "large" clusters are in fact the 
same. 

Proof. Choose c' and e such that 1 < c' < c and < e < 1 — 1/c'. 

We construct the graph using the following well known coupling: For every 
pair of vertices u and v, let Uu,v be a random variable uniformly distributed on 
[0, 1]. We add an edge between u and v if Uu,v < Pu,v 

Let G ~ Qi{n,c') constructed in this manner. Later we will increase the Puv 
by 

' (2) 

hi^nd{u,v) 

where 5 = c — d . The resulting graph is distributed the same as Qi{n,c). 
Note that 

log(n)</ii,„<21og(n) (3) 

Consider a standard exploration process on G starting at a vertex x. We ter- 
minate the exploration either when the explored set becomes larger than some 
function Lo{n) < n^ where u;(n) — s- oo as n — > cxd or when the exploration pro- 
cess dies. Following [4J (but with a modified definition) we call such functions 
admissible if for any 7 > 

Lo{n)/n^ -^ 

as n — > 00. Let the set B = B^ be set of x for which the process stopped for the 
former reason (B thus contains all the vertices in components larger than Lu{n)). 
Since a;(n) <C n"^, the exploration may be coupled between two branching 
processes. From above, we can couple it with Z"(c'), and from below by a similar 
process but where the offspring are given a random variable Y, the degree of x 
only counting neighbors more than a;(n) steps away (since the worst case is that 
we have already explored the a;(n) nearest vertices). Using ([3]) this gives 

B[Y]>c'(l-^-^^)>c'il-e). 
V logn J 

Let p' = p'{c') be the survival probability of this process. It then follows that, 

p' <P{xeB) <p{c)+o{l). 

By selecting e sufficiently small, we can make p' arbitrarily close to p{c') (Lemma 
14. ip . while the coupling still holds for n sufficiently large. Thus ^{x G S) — > p{c') 



-BlBl 
n 



as n ^ oo. Addition over all the vertices gives 

- P{c). (4) 

What remains is to show two things: 

1. For all admissible u{n), \B\/n -^ p{c') as n — > cxd. 

2. For some admissible uj{n), B consists of only one component. 

To prove the first claim, we note that the derivation of ^ did not depend 
on the choice of uj{n), and thus holds for all admissible functions. Given such a 
function, let uj'{n) be one strictly larger, and let B and B' be their respective sets 
of vertices in large components (note that B' C B). 



E\B\B'\ E\B\-E\B'\ 



0. 



n 



n 



(5) 



It follows that if \B\/n -^ p{c') holds for B, it must also hold for B', since 



\B'\ 



n 



> ei 



< 



P 
P 

+ P 



\B\ 


-\B\B'\ „ 


\B\ 
n 


n 
- P 


> ei/2) 



> ei 



\B\B'\ 



n 



>ei/2 







The second term of the convergence follows by ([5]) and the first moment method. 
Now let cv{n) < loglog(n). We will show the claim for this uj{n), and use 
the previous result to establish it for any faster growing uj{n). We now explore 
from two vertices x and y. Start the exploration from x first. At the end of this, 
we have found a connected subset C{x) of vertices around x. Because both the 
expected degree of each vertex and the variance is constant, a Chebyshev bound 
shows that the probability that we should encounter a vertex with more than uj{n) 
neighbors is o(l). Thus we can assume that |C(x)[ < 2w(n). 

P(y G C{x)) < M!^ = o{i) 

logn 

since at each step of the exploration, the probability that any vertex which is not 
y is connected to it is less than 1/2 log n by ([3]). Next we explore from y and until 
we have constructed a C{y). In each step of the exploration, the probability we 
draw a vertex in C{x) next is bounded from above by log log logn/ logn, so 

V{C{x)r^C{y) / 0) < 2 log log n ^"^!"^^"^"" + o(l) = o(l). 

logn 



10 



It follows that: 

PP - o(l) < Pix, y£B)< pic)pic') + 0(1) 
whence P{x,y € i?) = p'^ic') and 



lE[|Sp]-p2 



as n — > oo. This means that Var(|i3|/n) -^ and thus \B\/n -^ p(c'). The first 
claim is thus established. 

For the second claim, we will add the additional edges that we withheld in the 
beginning by increasing the threshold for edge existence by ([2]), and show that 
this connects all large clusters. We start by letting Lo{n) = log (n) and B be as 
before. We condition on the graph G constructed with the c' threshold, which we 
may assume has \B\ > {p — £2)^ (for €2 arbitrarily small) by the above. From G 
we create the graph G'^ by completing all the connected clusters of G - note that 
while this adds edges, it does not change the connectivity properties of the graph. 
In particular, if adding the additional edges makes B a connected component in 
G^, it does so also in G. 

Now select from B as many subsets Ki, K2, ■ ■ ■ , Km as possible, such that 
each Ki is a clique in G'^, and each \Ki\ = log (n). Since B consists only of 
connected clusters of size at least log^(n) in G'^, we can select these so that m = 
{p-e2-o{l))n/log^{n). 

Consider now the graph H, created by taking the Ki as vertices, as connecting 
Ki and Kj if a new edge is created between any two constituent vertices when 
the 6n edges are added. Since for any vertices x and y, d{x, y) < n/2 

P(x and y are connected by the new edges) > — 

2n log n 

It follows that the number of connections created between Ki and Kj dominates 
a random variable X which is Bin(log (n), (^/2nlog n) distributed. From a simple 
second moment estimate, one gets 

6log^{n) /(51og^(n)\ log^(n) 



P(X > 0) > ^ , , ^ , 

4nlogn \ 4 / n 

It follows that H is dominated by a graph of the form 

n / 6log^{n)\ log^(n) 



g (1-62 -0(1))- 



log^(n) ' V 4 J n 

of the standard Erdos-Renyi Q{n,p) family. But the threshold for Q{n,p) being 
completed connected a.a.s. is p ^ logn/n, which holds here. Thus H is a.a.s. 
connected, from which it follows that B is a.a.s. connected in the completed 
graph. This establishes the result. D 
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5 Generalisation and a Conjecture 

The result in Section U] shows that in the a-model, a < 1 gives a connectivity 
phase transition similar to classical random graphs, while a > 1 does not. It 
is of interest to understand more precisely when this transition occurs. That is, 
for more general Pu,v when does having a mean vertex degree greater than one 
sufficient for a giant component to emerge? Put more philosophically: when is a 
random graph a random graph, and when is it percolation? 

A generalization of the a-model in Definition 12.11 is given by letting 

„/r 1 ^N cf(d(u,v)) ,„. 

Pu,, = P({n, v}eE)= ^\^ " (6) 

rif,n 

where / : N — > M is some decreasing function, and 

h,n= Yl f(d{0,u)). (7) 

G°'{n,c) is thus given by letting f{x) = l/x". The question then becomes for 
which / functions c = 1 is the critical value above which a giant component 
emerges. 

Proposition 13.41 directlv provides a necessary requirement, namely that hf^ ~^ 
oo as n — > CO. Thus for instance f{x) = l/(xlog ~^^ x) for e > cannot behave 
in the "random graph manner". On the other hand, by retracing the steps of the 
Proof of Theorem 14.21 for the case f{x) = l/(xlogx), where /i/^„ ~ loglogn, it is 
easy to see that the critical c is exactly 1 also in this case. 

In fact, / may be such that /i/^„ is any logarithm iteration, and the meth- 
ods of the proof still go through with little modification. The only problematic 
requirement is that initial coupling with a branching process should break down 
before we have seen log n vertices, which is the same as requiring that 



as n — > oo. This is of course true for all but the very most slowly growing sums, 
but it does not hold for instance for / such that hf^n = log* n. 

In light of this, not withstanding that proving it would require a different 
method, we feel motivated to make the following conjecture 

Conjecture 5.1. In the random graph created by using edge probabilites given 
by equations ^ and ^, c = 1 is the critical value for the emergence of a giant 
component if, and only if, hf^n — > cxd as n ^ oo. 
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6 Conclusion 

The a-model spans the spectrum from structure-free random graph models to or- 
dinary percolation. When a < 1, the connectivity results more or less mirror those 
of random graphs, whereas for greater values they behave more like percolation. 
We note that the cases where q < 1 are exactly those where pu,v — > as n ^ oo 
for all u ^ V. An interesting question is to further explore this territory and see 
if this property, under some regularity (perhaps monotonicity) requirements, is 
sufficient for "random graph" type behavior, or if there are cases where this holds, 
but where the critical value is not one. 
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